Computational complexity arising from degree correlations in networks 
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We apply a Bethe-Peierls approach to statistical-mechanics models defined on random networks of 
arbitrary degree distribution and arbitrary correlations between the degrees of neighboring vertices. 
Using the NP-hard optimization problem of finding minimal vertex covers on these graphs, we 
show that such correlations may lead to a qualitatively different solution structure as compared to 
uncorrelated networks. This results in a higher complexity of the network in a computational sense: 
Simple heuristic algorithms fail to find a minimal vertex cover in the highly correlated case, whereas 
uncorrelated networks seem to be simple from the point of view of combinatorial optimization. 
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The last few years have seen a great advance in the 
study of complex networks where the term complex 
refers to the existence of one or more of the following 
properties: small world effect || , power-law degree distri- 
bution jjj , and more recently also correlations || ||, ||, |) . 
On the other hand, if we focus on the solution of a given 
task on top of these networks, the term complex is better 
associated with the time required to solve it, i.e. with its 
computational complexity In this context, a problem 
is complex if its algorithmic solution time is growing ex- 
ponentially in the network size. At the core of complex 
optimization problems one finds the NP-hard class 0, 
where NP stands for non-deterministic polynomial time. 

In the case of uncorrelated networks with power law 
degree distributions we can take profit of the existence 
of hubs to solve different problems, like destroying the 
giant component ||, preventing epidemic outbreaks 
and searching ]l0| ] . The extremely inhomogeneous struc- 
ture of uncorrelated networks can also be exploited to 
approximate or even to solve instances of NP-hard prob- 
lems using heuristic algorithms running in polynomial 
time. However, the influence of properties like degree 
correlations or clustering is not clear yet. Recent studies 
of percolation j(| and disease spreading jnj have shown 
that degree correlations can quantitatively change, e.g., 
the transition threshold but qualitatively the results are 
similar to those obtained for uncorrelated networks. 

This changes drastically if we consider hard optimiza- 
tion tasks defined over correlated networks. In this work 
we study the influence of degree correlations on the com- 
putational complexity, and in a more general perspective 
the relation between the topology of complex networks 
and the computational complexity of hard problems de- 
fined on top of them. For this purpose we generalize the 
Bethe-Peierls approach to statistical-mechanics models 
defined on networks with an arbitrary degree distribu- 
tion and arbitrary degree correlations of adjacent nodes. 

The approach is applied to characterize the minimal 
vertex covers on these graphs. We have chosen this prob- 



lem for two reasons: It belongs to the basic NP-hard op- 
timization problems over graphs Q], and has found ap- 
plications in monitoring Internet traffic fl^| and denial of 
service attack prevention |Ts)| . Our analytical results are 
later compared with an approximate solution obtained 
using a heuristic algorithm. This heuristic fails to find 
minimal vertex covers in the strongly correlated case, 
whereas networks with low correlations seem to be sim- 
ple from the point of view of combinatorial optimization. 
Within our analytical approach, this change of behavior 
is associated with replica symmetry breaking (RSB). 

Consider the set of undirected graphs with N vertices 
and arbitrary degree distribution pd- Following a ran- 
domly chosen edge, we will find a vertex of degree d + 1 
with probability q e i = (d+ l)pd+i/c, with c denoting the 
average degree. The number d of additional edges will be 
called excess degree. We further assume correlations be- 
tween adjacent vertices: The probability that a randomly 
chosen edge connects two vertices of excess degrees d, d' 
is given by (2 — o~d,d')edd'- The conditional probability 
that a vertex of excess degree d is reached following any 
edge coming from a vertex of excess degree d! , 



p{d\d!) = edd'/qd' 



(1) 



thus explicitly depends on both d and d! . Consistency 
with the degree distribution requires J2d'=o e dd' = Id, 
and edd' has to be symmetric. For uncorrelated graphs 
e-dd' = Qdld' factorizes. The strongest positive correla- 
tions are reached for edd' = qd$dd' where only vertices of 
equal degrees are connected. 

Let us now consider a general statistical-mechanics 
model with discrete degrees of freedom defined on ver- 
tices, and interactions defined on edges. We use a lattice 
gas model described by the Hamiltonian 



(3H = 2J JijW(Xi,Xj) +^^: 



(2) 



i<j 



defined for any microscopic configuration Xi = 0, 1, i — 
l,...,N. J is the adjacency matrix with entries Jij = 1 



2 



if vertices i and j are adjacent, and J„ = else. The 
inverse temperature is denoted by j3, the chemical poten- 
tial by fi. The interactions w(xi,Xj) are arbitrary, thus 
including also the case of the ferromagnetic Ising model, 
w(xi, Xj) — (2xi — l)(2xj — 1). The only disorder present 
in Eq. (j^) is given by the edges Jij. Generalizations to 
disordered interactions, as present e.g. in spin-glasses, 
random local fields or non-binary discrete variables are 
evident. For clarity of the presentation, we restrict our- 
selves to the simple model given above. 

Since the graphs are locally tree-like, the model can be 
solved by the iterative Bethe-Peierls scheme which be- 
comes exact if only one pure state is present. The free 
energy can be expressed in terms of simple effective-field 
distributions acting on vertices of given degree. In the 
case of multiple pure states this has to be generalized to 
the cavity approach, see e.g. [jl5| for the example of a 
spin-glass on a Bethe lattice of constant vertex degree. 
Alternatively, one can apply the replica approach. The 
simple Bethe-Peierls solution corresponds to the assump- 
tion of replica symmetry (RS), whereas the full cavity 
approach is able to handle also the case of RSB. 

Take any edge i.e. Jij = 1. Let us introduce 

Zx as the partition function of the subtree rooted in 
i, with deleted edge and with Xi fixed to the value 

x. This partition function can be calculated iteratively, 



depends on the cavity field 

hi=fl+ ^2 U ( h (k\i)) 



(9) 



fcl Jifc=i 



resulting from the influence of all neighbors on vertex i. 

Let us now assume, that the model has only one pure 
state, which corresponds to the assumption of RS. In this 
case, the iterative procedure given by Eq. (Q) converges 
to well-defined distributions Pd(h) of effective fields huu^ 
restricted to vertices of excess degree d. They are deter- 
mined by the self-consistency equation 



Pd{h) 



JJ [dhi J^p(di\d)P dl (hi 



d,=0 



(10) 



Please note that, in contrast to the uncorrelated case, 
we do need the field distributions for all possible excess 
degrees. In the uncorrelated case, the average of these 
distributions over q d is sufficient. We may also introduce 
the analogous distributions P d (h) of cavity fields hi for 
vertices of given degree d (note: here the full degree is 
relevant), they can be calculated from P d {h) by 



Z, 



m _ 



n (e~Wz™ + e«°>M m ) (3) P d (h) = rf[U± P (dM-l)PM) 



r(i\3) 



The effective fields h (i]j) = ln(zf lj) /Z^ j) ) are thus de- 
termined by the iterative description 

h m =fi+ u(h {m ) . (4) 

where uQiru^) is the effective field induced by Xk on site 
i, and is given by 

/ e w(i,o) + e w(x,i)+h Wi) \ 
u{h ^ ) = h \ eW (o,o) +eW (o^ + k (klt) ) ■ ( 5 ) 

The free energy of the system can be written as 

-PNf = j2 fi + E ifa -fi- fi) 

i i<j 

where the link contribution equals 



(11) 



(G) 



whereas the site contribution 



(7) 



(8) 



Replacing the sum over vertices and edges in Eq. (g) by 
the corresponding averages over P d {h) resp. P d (h), we 
finally find the free-energy density 

/*°° 

Pf = -J2( d ~^Pd / dh P d (h) In (l + e h ) 

OO n QQ 

+| edd ' / dhdh'P d {h)P d ,{h') (12) 

d,d'=o J ~°° 

m ( e ^(0,0) + e w(l,0)+h _|_ e w(0,l)+V + e w(l,l)+h+h'\ 

The simplest application of this approach is given by 
the ferromagnetic Ising model. If we look to the ground- 
states, i.e. to the limit (3 — > oo, we find that, as to 
be expected, the global magnetization is determined by 
the size of the giant component. The existence of a fer- 
romagnetic phase at low temperature is thus related to 
percolation. The latter was already analyzed in Ref. || . 

Another application is given by the vertex cover (VC) 
problem. It belongs to the basic NP-hard optimization 
problems and, therefore, it is expected to require a 
solution time which is growing exponentially with the 
graph size. Let us be more precise. Given a graph with 



3 



vertices I <E {1,...,N} and edges < i < j < 

N, Jij = 1}, a vertex cover V is a subset of vertices, V C 
{1, N}, such that at least one end- vertex of every edge 
is contained in V. So no edge (i, j) is allowed to exist with 
i ^ V and j ^ V. Of course, the set of all vertices forms 
a trivial VC. The hard optimization problem consists in 
finding the minimal VC. 

Using the hard-sphere lattice-gas representation intro- 
duced in pS, where x% = 1 if i £ V, and x, = if i e V, 
the VC condition can be rewritten as 



n (i - EiEj) = 1 

0'.j)|Jii=i 



(13) 



which fits into the above framework by setting e w ^ Xi,Xj ^ = 
1 — XiXj. The chemical potential can be used to fix the 
cardinality of the VC, minimal ones are obtained in the 
limit ji — ► oo. They correspond to maximal packings in 
the lattice-gas picture. To perform the limit fi — > oo all 
fields have to be rescaled as h = fxz Q . We obtain 

/oo d / oo \ 

n <fci£p(*id)p*(2i) 
-°° ;=i V d != o / 

x<5 ^z-l-^max(0,z ; )^ , ( 14 ) 

which is solved by Pd{z) = J2t^i P^^i 2 + 0- This 
ansatz allows for integer valued fields only, we find a sim- 
ple relation including only the p_\ : 



(d) 



All other follow easily. The expression inside the 
parenthesis can be understood as the average probability 
ltd that an edge entering a vertex of degree d + 1 car- 
ries a constraint, i.e. that it is not yet covered by the 
neighboring vertex. It thus fulfills the condition 



f>(di|d)(l 



Adxh 



.di=0 



(15) 



7Td 



X>(<*lW(l-^) dl 



(16) 



d,=0 



Having in mind that, due to the limit p — > oo, every 
vertex with positive z is fixed to x — 1, every one with 
negative z has x — 0, we can immediately read off the 
fraction of vertices belonging to a minimal VC, 



1 



E 

d=0 



Pd(l - 7Td_l) 



d-2 



-7Td-l 



(17) 



Remember that the last expressions are related to the 
validity of RS, i.e. to the existence of a single connected 
cluster of minimal VCs in configuration space. As ob- 
served in [Il6| , RS is related to the local stability of this 



solution. In presence of RSB, Eq. ( |iq ) has no stable so- 
lution. Since it has to be solved by numerical iteration 
in the general case, an instability prevents the program 
from convergence and thus provides a precise tool to de- 
tect RSB without any RSB calculation. 

To see how this works out, we concentrate on networks 
having equal degree distributions but different correla- 
tion properties. We restrict our attention to scale- free 
graphs with pd ~ dr 1 for d = 1, oo, with 7 > 2. For 
vertex cover, interesting effects are expected to appear for 
positive correlations, or assortative networks. We there- 
fore consider 



e-dd' = qd [rS d ,d> + (1 - r)q d > 



(18) 



For < r < 1, this expression linearly interpolates be- 
tween uncorrelated (r = 0) and fully assorted (r = 1) 
networks. Please note that this network is percolated for 
all 7 as soon as r > 0. In Fig. |l| we show the resulting 
size of the minimal VCs for different values of 7 as a func- 
tion of r. The RS solution breaks at a certain value of r 
that depends on 7. There, the solution-space structure 
changes drastically, from being unstructured, or RS, in 
the low correlated case to being clustered, or RSB, for 
sufficiently high correlations. 

To check the consequences of this transition for heuris- 
tic optimization algorithms, we have numerically gen- 
erated scale- free networks with correlations (18), and 



applied a generalization of the leaf-removal heuristic of 
Bauer and Golinelli [17|. For the special case of correla- 
tions given by Eq. (18), the network can be generated 
using a modification of the Molloy-Reed algorithm E3] 
for random graphs of arbitrary degree distribution. First, 
each node i is assigned a degree dk with probability pd t ■ 
Then we create a set C containing di copies of each node 
i. Finally, pairs of nodes are connected according to fol- 
lowing rule: (i) Select a node i in £ at random, (ii) With 
probability r, select a second node j £ C with dj = di in 
C at random; otherwise select an arbitrary node j G C 
at random, (in) Connect i and j and delete both from 
L. This is repeated until C is empty. 

Once the network is generated, we construct a VC us- 
ing a generalized leaf-removal algorithm E^] defined as 
follows: Select a vertex of minimal current degree from 
the network and cover all its neighbors. The considered 
vertices and all incident edges are removed from the net- 
work. This step is iterated untill the full network is re- 
moved. If, for some graph, this algorithm stops without 
having ever chosen vertices of current degree d > 2, the 
constructed VC is minimal ]i"7j ]. Overestimations may 
appear if the algorithm is forced to select also vertices 
of higher degree d > 2, where the error can be at most 
d— 1. Thus, summing (d—l)(l — 5d,o)/N over all iteration 
steps, we get an upper bound Ax on the total error made 
in estimating x c using the above heuristic algorithm. If 
Ax goes to zero in the large- N limit, the algorithm has 
consequently constructed an almost minimal VC. 
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FIG. 1: Minimal VC size for a network with degree distribu- 
tion pd ~ dT 1 and degree correlations given byEq. ©. The 
lines give the analytical solution for 7 = 2.5 (lower curve) and 
7 = 3.0 (upper curve). The curves stop at the point where 
RS breaks. The symbols are numerical estimates for 7 = 2.5 
(circles) and 7 = 3.0 (squares), and network size N = 10 6 . In 
the inset we plot the upper error bound Ax for generalized 
leaf removal. The onset of non-zero error coincides with the 
RSB transition, marked by arrows. 

In Fig. [I] we show the size of VCs found by generalized 
leaf removal as a function of r. Up to the RSB point the 
numerical solutions are close to the analytical values, up 
to finite-size corrections resulting mainly from a degree 
cutoff d m ax ~ N 1 ^ 1 . Beyond the RSB point we still have 
a numerical estimate but we cannot be sure that it is 
optimal. In the inset of Fig. ^ the upper bound on the 
error is displayed. In the RS region we have Ace ~ for 
N 3> 1 and, therefore, the heuristic algorithm asymptot- 
ically yields the exact value x c - However, in the highly 
correlated region we find a finite Ax at any network size, 
thus the heuristic algorithm fails to find almost minimal 
vertex covers. Moreover, the point where Ax becomes 
different from zero coincides with the RSB point. 

It is interesting to know in which phase realistic net- 
works are. VCs have found applications in monitoring the 
Internet traffic jl^] and in denial of service attack pre- 
vention Jl3| . The analysis of Internet maps has revealed 
negative (dissasortative) correlations at the autonomous 
system level [[|. Negative correlations are actually com- 
mon in technological and biological networks ||. Hence, 
the generalized leaf-removal heuristic should output al- 
most optimal VCs in linear time. On the contrary, social 
networks exhibit positive (assortative) correlations |(| . In 
this case VCs can be used to monitor social relations be- 
tween pairs of individuals but, because of the existence 
of positive correlations, simple heuristic algorithms may 
fail to produce near optimal solutions. 

To summarize, we have generalized the Bethe-Peierls 
approach to random networks with degree correlations 
and analyzed the VC problem as a prototype optimiza- 



tion problem defined over graphs. We have found that 
uncorrelated power-law networks are simple from the 
point of view of combinatorial optimization, inhomo- 
geneities of neighboring vertices can be exploited. The 
introduction of sufficiently large degree correlations leads 
to RSB and thus to a failure of simple heuristic algo- 
rithms. For constructing optimal solutions, complete al- 
gorithms including e.g. backtracking have to be used. 
These, however, result in general in exponential solution 
times, and thus in a higher algorithmic complexity. Our 
results point out that optimization problems in many 
technological and biological networks can be simple due 
to the strong degree inhomogeneities and negative corre- 
lations present on them. 

We acknowledge fruitful discussions with M. Leone, A. 
Vespignani and R. Zecchina. 
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